RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 0$ /QSSj 25*1 4 

Source: 'CW/ 

Date Processed by STIC: 7- "3~cru 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin2 1 help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM, ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 191 1 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 



Revised 01/29/2002 



Raw Sequence Listing Error Summary . 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS. WHICH WERE INSERTED BY PTO SOFTWARE 



1 



^Wrapped Nucleics The numberAext at the end of each line "wrapped" down to the next line. This may occur ifyourfile 
Wrapped Aminos • was retrieved in a word processor after creating iL Please adjust your right margin to .3; this will 
prevent •'wrapping/* 

Jnvalid Line Length The rules require that a line not exceed 72 characters in length. This includes while spaces. 

Misaligned Amino The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 



Numbering 
A Non-ASCII 

5 Variable Length 



The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission Is saved in ASCII text. 

Sequcncc(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

_PatentIn 2.0 A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

"bug" sequencers) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

_Skippcd Sequences Sequencers) missing. If intentional, please insert the following lines for each skipped sequence: 

(OLD RULES) (2) INFORMATION FOR SEQ ED NO:X: (insert SEQ ID NO where "X" is shown) 

(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

missing. If intentional, please insert the following lines for each skipped sequence. 



8 Skipped Sequences Scqucncc(s) 

(NEW RULES) <2 1 0> sequence id number 
<400> sequence id number 
000 

JJsc of n's or Xaa's Use of n's and/or Xaa's have been detected in the Sequence Listing. 



(NEW RULES) 

10 ^ X Lwalid<213> 
Response 



Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n*s or Xaa's are present 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Ojjjcnown or 
is Artificial Sequence 

3 



Sequences) J 



m missing the <220> "Feature" and associated numeric identifiers and responses. 



12 



Patentln 2.0 



"bug- 



Use of <220>O fljC223> is M A NDATORY if <213> "Organism" response is "Artifi cial Sequence" or 
"Unknown." ^ Please explain source of genetic material in <220> to <223> section) 
(See "Federal Register," 06A) 1/1998, Vol. 63, No. I W, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to .floppy disk. 



AMC - Biotechnology Systems Branch - 06704/2001 



Page 1 of 7 




OIPE 



RAW SEQUENCE LISTING DATE: 07/03/2002 

PATENT APPLICATION: US/09/955 , 259A TIME: 14:15:48 

Input Set : A:\52071-4.ST25.txt 

Output Set: N:\CRF3\07032002\I955259A.raw 

3 <110> APPLICANT: Annibali, Nestor 

5 <120> TITLE OF INVENTION: Expression of a Human Insulin Precursor In P. Pastoris 
7 <130> FILE REFERENCE: 52071.4 

9 <140> CURRENT APPLICATION NUMBER: US 09/955 ,259A 
10 <141> CURRENT FILING DATE: 2001-09-12 
12 <160> NUMBER OF SEQ ID NOS : 26 
14 <170> SOFTWARE: Patentln version 3.1 

16 <210> SEQ ID NO: 1 

17 <211> LENGTH: 36 

18 <212> TYPE: DNA 

19 <213> ORGANISM: Artificial Sequence 

21 <220> FEATURE: 

22 <223> OTHER INFORMATION: Synthetic Primer 

24 <400> SEQUENCE: 1 

25 tcacacctgg tggaagctct ctacctagtg tgcggg 36 

28 <210> SEQ ID NO: 2 

29 <211> LENGTH: 45 

30 <212> TYPE: DNA 

31 <213> ORGANISM: Artificial Sequence 

33 <220> FEATURE: 

34 <223> OTHER INFORMATION: Synthetic Primer 

36 <400> SEQUENCE: 2 

37 ggtcttgggt gtgtagaaga agcctcgttc cccgcacact aggta 4 5 

40 <210> SEQ ID NO: 3 » 

41 <211> LENGTH: 39 / j/ / 0 £ // 

42 <212> TYPE: DNA C-C < ' 

43 <213> ORGANISM: Artificial Sequence ^ ^ ^ PjCfto/^- 

45 <220> FEATURE: ■ - " 1 

46 <223> OTHER INFORMATION: /gctggtacag cattgttcca caatgccacg cttggtcttg ggtgt) O. ^^ / * / °] 
48 <400> SEQUENCE: 3 ^ — ; — * ' / 

4 9 tttgtgaacc aacacctgtg cggctcacac ctggtggaa 39 Sl/^J^\ 

52 <210> SEQ ID NO: 4 

53 <211> LENGTH: 45 

54 <212> TYPE: DNA 

55 <213> ORGANISM: Artificial Sequence 

57 <220> FEATURE: 

58 <223> OTHER INFORMATION: Synthetic Primer 

60 <400> SEQUENCE: 4 

61 gctggtacag cattgttcca caatgccacg cttggtcttg ggtgt 45 

64 <210> SEQ ID NO: 5 

65 <211> LENGTH: 52 

66 <212> TYPE: DNA 

67 <213> ORGANISM: Artificial Sequence 



file://C:\Crf3\Outhold\VsrI955259A.htm 
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RAW SEQUENCE LISTING DATE: 07/03/2002 

PATENT APPLICATION: US/09/955 , 259A TIME: 14:15:48 

Input Set : A:\52071-4.ST25.txt 

Output Set: N:\CRF3\07032002\l955259A.raw 

69 <220> FEATURE : 

70 <223> OTHER INFORMATION: Synthetic Primer 

72 <400> SEQUENCE: 5 

73 ctagttgcag tagttctcca gctggtagag ggagcagatg ctggtacagc at 52 

76 <210> SEQ ID NO: 6 

77 <211> LENGTH: 162 

78 <212> TYPE: DNA 

79 <213> ORGANISM: Artificial Sequence 

81 <220> FEATURE,: 

82 <223> OTHER INFORMATION: complete synthetic insulin precursor obtained by PCR using 

human 

83 insulin sequence as original source 

85 <400> SEQUENCE: 6 

86 tttgtgaacc aacacctgtg cggctcacac ctggtggaag ctctctacct agtgtgcggg 60 
88 gaacgaggct tcttctacac acccaagacc aagcgtggca ttgtggaaca atgctgtacc 120 
90 agcatctgct ccctctacca gctggagaac tactgcaact ag 162 

93 <210> SEQ ID NO: 7 

94 <211> LENGTH: 50 

95 <212> TYPE: DNA 

96 <213> ORGANISM: Artificial Sequence 

98 <220> FEATURE: 

99 <223> OTHER INFORMATION: Synthetic Primer 

101 <400> SEQUENCE: 7 

102 acttggttga agctttgtac ttggtttgtg gtgaaagagg tttcttctac 50 

105 <210> SEQ ID NO: 8 

106 <211> LENGTH: 50 

107 <212> TYPE: DNA 

108 <213> ORGANISM: Artificial Sequence 

110 <220> FEATURE: 

111 <223> OTHER INFORMATION: Synthetic Primer 

113 <400> SEQUENCE: 8 

114 agaagtacaa cattgttcaa cgatacctct cttagtcttt ggagtgtaga 50 

117 <210> SEQ ID NO: 9 

118 <211> LENGTH: 33 

119 <212> TYPE: DNA 

120 <213> ORGANISM: Artificial Sequence 

122 <220> FEATURE: 

123 <223> OTHER INFORMATION: Synthetic Primer 

125 <400> SEQUENCE: 9 

126 acacttgtgt ggttctcact tggttgaagc ttt 33 

129 <210> SEQ ID NO: 10 

130 <211> LENGTH: 66 

131 <212> TYPE: DNA 

132 <213> ORGANISM: Artificial Sequence 

134 <220> FEATURE: 

135 <223> OTHER INFORMATION: Synthetic Primer 

137 <400> SEQUENCE: 10 

138 ttactcgagt tagttacagt agttttccaa ttggtacaaa gaacagatag aagtacaaca 60 
140 ttgttc 66 
143 <210> SEQ ID NO: 11 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/955 , 259A 



DATE: 07/03/2002 
TIME: 14:15:48 



Input Set : A:\52071-4.ST25.txt 

Output Set: N:\CRF3\07032002\l955259A.raw 



144 <211> LENGTH: 36 

145 <212> TYPE: DNA 

146 <213> ORGANISM: Artificial Sequence 

148 <220> FEATURE: 

149 <223> OTHER INFORMATION: Synthetic Primer 

151 <400> SEQUENCE: 11 

152 ccgctcgaga agagatttgt taaccaacac ttgtgt 

155 <210> SEQ ID NO: 12 

156 <211> LENGTH: 162 

157 <212> TYPE: DNA 

158 <213> ORGANISM: Artificial Sequence 

160 <220> FEATURE: 

161 <223> OTHER INFORMATION: synthetic insulin precursor, obtained by 



164 <400> SEQUENCE: 12 

165 tttgttaacc aacacttgtg tggttctcac ttggttgaag ctttgtactt ggtttgtggt 
167 gaaagaggtt tcttctacac tccaaagact aagagaggta tcgttgaaca atgttgtact 
169 tctatctgtt ctttgtacca attggaaaac tactgtaact aa 

172 <210> SEQ ID NO: 13 

173 <211> LENGTH: 56 

174 <212> TYPE: DNA 

175 <213> ORGANISM: Artificial Sequence 

177 <220> FEATURE: 

178 <223> OTHER INFORMATION: Synthetic Primer 

180 <400> SEQUENCE: 13 

181 cgcggatcca aaccatgaga ttcccatcta tcttcactgc tgttttgttc gctgct 

184 <210> SEQ ID NO: 14 

185 <211> LENGTH: 68 

186 <212> TYPE: DNA 

187 <213> ORGANISM: Artificial Sequence 

189 <220> FEATURE: 

190 <223> OTHER INFORMATION: Synthetic Primer 

192 <400> SEQUENCE: 14 

193 gttttgttcg ctgcttcttc tgctttggct gctcctgtta acactactac tgaagacgaa 
195 actgctca 

198 <210> SEQ ID NO: 15 

199 <211> LENGTH: 71 

200 <212> TYPE: DNA 

201 <213> ORGANISM: Artificial Sequence 

203 <220> FEATURE: 

204 <223> OTHER INFORMATION: Synthetic Primer 

206 <400> SEQUENCE: 15 

207 acgtcgaagt caccttccaa gtcagagtaa ccgataaccg cttcagctgg gatttgagca 
209 gtttcgtctt c 

212 <210> SEQ ID NO: 16 

213 <211> LENGTH: 66 

214 <212> TYPE: DNA 

215 <213> ORGANISM: Artificial Sequence 
217 <220> FEATURE: 



60 
120 
162 



56 



60 
68 



60 
71 



insulin 
162 



sequence as original source 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/955 f 259A 



DATE: 07/03/2002 
TIME: 14:15:48 



Input Set : A:\52071-4.ST25.txt 

Output Set: N:\CRF3\07032002\I955259A.raw 



218 <223> OTHER INFORMATION: Synthetic Primer 

220 <400> SEQUENCE: 16 

221 gatgaacaac aaaccattat tagtagagtt agagaaaggc aaaacagcaa cgtcgaagtc 
223 accttc 

226 <210> SEQ ID NO: 17 

227 <211> LENGTH: 72 

228 <212> TYPE: DNA 

229 <213> ORGANISM: Artificial Sequence 

231 <220> FEATURE: 

232 <223> OTHER INFORMATION: Synthetic Primer 

234 <400> SEQUENCE: 17 

235 ccgctcgaga gaaacaccct cttccttagc agcgatagaa gcgatagtag tgttgatgaa 
237 caacaaacca tt 

240 <210> SEQ ID NO: 18 

241 <211> LENGTH: 267 

242 <212> TYPE: DNA 

243 <213> ORGANISM: Artificial Sequence 
245 <220> FEATURE: 

24 6 <223> OTHER INFORMATION: synthetic sequence of alpha factor from S. 



249 <400> SEQUENCE: 18 

250 atgagattcc catctatctt cactgctgtt ttgttcgctg cttcttctgc tttggctgct 
252 cctgttaaca ctactactga agacgaaact gctcaaatcc cagctgaagc ggttatcggt 
254 tactctgact tggaaggtga cttcgacgtt gctgttttgc ctttctctaa ctctactaat 
256 aatggtttgt tgttcatcaa cactactatc gcttctatcg ctgctaagga agagggtgtt 
258 tctctcgaga agagagaggc tgaagca 

261 <210> SEQ ID NO: 19 

262 <211> LENGTH: 44 

263 <212> TYPE: DNA 

264 <213> ORGANISM: Artificial Sequence 

266 <220> FEATURE: 

267 <223> OTHER INFORMATION: Synthetic Primer 

269 <400> SEQUENCE: 19 

270 ggggatccat atgctcgaga aaagatttgt gaaccaacac ctgt 

273 <210> SEQ ID NO: 20 

274 <211> LENGTH: 32 

275 <212> TYPE: DNA 

276 <213> ORGANISM: Artificial Sequence 

278 <220> FEATURE: 

279 <223> OTHER INFORMATION: Synthetic Primer 

281 <400> SEQUENCE: 20 

282 ttagaattcc cgggtctagt tgcagtagtt ct 

285 <210> SEQ ID NO: 21 

286 <211> LENGTH: 30 

287 <212> TYPE: DNA 

288 <213> ORGANISM: Artificial Sequence 

290 <220> FEATURE: 

291 <223> OTHER INFORMATION: Synthetic Primer 
293 <400> SEQUENCE: 21 



60 
66 



60 
72 



cerevisiae, 



60 
120 
180 
240 
267 



44 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/955 , 259A 



DATE: 07/03/2002 
TIME: 14:15:48 



Input Set : A:\52071-4.ST25.txt 

Output Set: N:\CRF3\07032002\l955259A.raw 



294 



tcactcgagc ggtctagttg cagtagttct 



30 



297 <210> SEQ ID NO: 22 

298 <211> LENGTH: 28 

299 <212> TYPE: DNA 

300 <213> ORGANISM: Artificial Sequence 

302 <220> FEATURE: 

303 <223> OTHER INFORMATION: Synthetic Primer 

305 <400> SEQUENCE: 22 

306 gtcgtggttt ctcatagtag agtggaca 28 

309 <210> SEQ ID NO: 23 

310 <211> LENGTH: 18 

311 <212> TYPE: DNA 

312 <213> ORGANISM: Artificial Sequence 

314 <220> FEATURE: 

315 <223> OTHER INFORMATION: Synthetic Primer 

317 <400> SEQUENCE: 23 

318 ggtcatcact gctccatc 18 

321 <210> SEQ ID NO: 24 

322 <211> LENGTH: 19 

323 <212> TYPE: DNA 

324 <213> ORGANISM: Artificial Sequence 

326 <220> FEATURE: 

327 <223> OTHER INFORMATION: Synthetic Primer 

329 <400> SEQUENCE: 24 

330 agcagcacca gtggaagat 19 

333 <210> SEQ ID NO: 25 

334 <211> LENGTH: 21 

335 <212> TYPE: DNA 

336 <213> ORGANISM: Artificial Sequence 

338 <220> FEATURE: 

339 <223> OTHER INFORMATION: Synthetic Primer 

341 <400> SEQUENCE: 25 

342 gactggttcc aattgacaag c 21 

345 <210> SEQ ID NO: 26 

346 <211> LENGTH: 4 

347 <212> TYPE: PRT 

348 <213> ORGANISM: Saccharomyces cerevisiae 
350 <400> SEQUENCE: 26 

352 Lys Arg Glu Ala 

353 1 
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VERIFICATION SUMMARY DATE: 07/03/2002 

PATENT APPLICATION: US/09/955 , 2 59A TIME: 14:15:49 

Input Set : A:\52071-4.ST25.txt 

Output Set: N:\CRF3\07032002\I955259A.raw 
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